Choice of parallelism: multi-GPU driven pipeline for huge academic backbone network
نویسندگان
چکیده
Science Information Network (SINET) is a Japanese academic backbone network for more than 800 research institutions and universities. In this paper, we present multi-GPU-driven pipeline handling huge session data of SINET. Our consists ELK stack, multi-GPU server, Splunk. A server responsible two procedures: discrimination histogramming. Discrimination dividing into ingoing/outgoing with subnet mask calculation address matching. Histogramming grouping bins map-reduce. our architecture, use GPU the acceleration ingress/egress data. Also, tiling design pattern building two-stage map-reduce CPU GPU. has succeeded in processing workloads about 1.2–1.6 billion streams (500–650 GB) within 24 hours.
منابع مشابه
Synkhronos: a Multi-GPU Theano Extension for Data Parallelism
We present Synkhronos, an extension to Theano for multi-GPU computations leveraging data parallelism. Our framework provides automated execution and synchronization across devices, allowing users to continue to write serial programs without risk of race conditions. The NVIDIA Collective Communication Library is used for high-bandwidth inter-GPU communication. Further enhancements to the Theano ...
متن کاملMulti-level parallelism for incompressible flow computations on GPU clusters
We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA parallel implementations, in which all computations are done on the GPU using CUDA. We explore efficiency and scalability of incompressible flow computations using up to 256 GPUs on a problem with approximately 17.2 billion cells. Our work addresses some of the unique issues faced when merging fine-g...
متن کاملNew Directions for a Japanese Academic Backbone Network
This paper describes an architectural design and related services of a new Japanese academic backbone network, called SINET5, which will be launched in April 2016. The network will cover all 47 prefectures with 100-Gigabit Ethernet technology and connect each pair of prefectures with a minimized latency. This will enable users to leverage evolving cloud-computing powers as well as draw on a hig...
متن کاملMulti-objective exploitation of pipeline parallelism using clustering, replication and duplication in embedded multi-core systems
With the popularity of mobile device, people require more computing power to run emerging applications. However, the increase in power consumption is a major problem because power is quite limited in embedded systems. Our goal is to consider power consumption along with latency and throughput. We proposed a heuristic algorithm, called Parallel Pipeline Latency Optimization for high performance ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Parallel, Emergent and Distributed Systems
سال: 2021
ISSN: ['1744-5779', '1744-5760']
DOI: https://doi.org/10.1080/17445760.2021.1941009